Author: Aditya Sundar

Email: aditya.1094@gmail.com

Note: There is a heading before each cell or group of cells. Headings in black font denote code for data retrieval and cleaning. Headings in red font denote visualization codes that can be run directly using processed data.

Modules

Web scraping to get data for electric vehicles (EV) sales in the USA

Data includes various types of electric vehicles, manufactured by several automobile companies, showing purchasing trends from 2000 to 2022.

Data cleaning

The data is binned annually from 2000-2022. The earliest recorded year among all datasets is 2003. The latest recorded year is 2022. For organisation into a dataframe, cumulative EV sales for the year without any recorded data are assigned the value of min(0, cumulative EV sales for the previous year).

Visualisation of EV sales: state-wise and year-wise

New EV registered per year

Web scraping to get data for alternative fuel stations in the USA

Data downloaded on 10 August 2022

Visualisation of fuel stations by fuel-type

Web scraping to get data for EV models in Europe

Data retrieved from https://ev-database.org/ on 11 August 2022

I was not able to find a database listing vehicle availability in the United States. For preliminary results, I explored a published databased listing 245 EVs in Europe. In the future, this kind of analysis will be extended to Evs in the USA.

Data cleaning

Data Visualisation

Regression models to predict the cost of EVs (using the Europe dataset)

Normal regression: 5-fold, 5-time cross-validation

Ensemble regression: 5-fold, 5-time cross-validation